Perfect $L_p$ Sampling in a Data Stream

نویسندگان

چکیده

In this paper, we resolve the one-pass space complexity of perfect $L_p$ sampling for $p \in (0,2)$ in a stream. Given stream updates (insertions and deletions) to coordinates an underl...

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Hybrid Data Stream Summarizing Approach by Sampling and Clustering

Computer systems generate a large amount of data that, in terms of space and time, is very expensive even impossible to store. Besides this, many applications need to keep an historical view of such data in order to provide historical aggregated information, perform data mining tasks or detect anomalous behavior in the computer systems. One solution is to treat the data as streams that can be p...

متن کامل

Perfect sampling without a lifetime commitment

Generating perfect samples from distributions using Markov chains has a wide range of applications , from statistical physics to approximation algorithms. In perfect sampling algorithms, a sample is drawn exactly from the stationary distribution of a chain, as opposed to methods that run the chain \for a long time" and create samples drawn from a distribution that is close to the stationary dis...

متن کامل

Continuous Monitoring of l_p Norms in Data Streams

In insertion-only streaming, one sees a sequence of indices a1, a2, . . . , am ∈ [n]. The stream defines a sequence of m frequency vectors x, . . . , x ∈ R with (x)i def = |{j : j ∈ [t], aj = i}|. That is, x is the frequency vector after seeing the first t items in the stream. Much work in the streaming literature focuses on estimating some function f(x). Many applications though require obtain...

متن کامل

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

Antithetic Coupling for Perfect Sampling

This paper reports some initial investigations of the use of antithetic variates in perfect sampling. A simple random walk example is presented to illustrate the key ingredients of antithetic coupling for perfect sampling as well as its potential benefit. A key step in implementing antithetic coupling is to generate random variates that are negatively associated, a stronger condition than negat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SIAM Journal on Computing

سال: 2021

ISSN: ['1095-7111', '0097-5397']

DOI: https://doi.org/10.1137/18m1229912